Adaptive XML Shredding: Architecture, Implementation, and Challenges

نویسندگان

  • Juliana Freire
  • Jérôme Siméon
چکیده

As XML data becomes central to business-critical applications, there is a growing need for efficient and reliable XML storage. Two main approaches have been proposed for storing XML data: native and colonial systems. Native systems (e.g., [9, 20]) are designed from the ground up specifically for XML and XML query languages. Colonial systems (e.g., [5, 7, 19]), on the other hand, attempt to reuse existing commercial database systems (DBMS) by mapping XML into the underlying model used by the DBMS. Colonial systems can thus leverage features, such as concurrency control, crash recovery, scalability, and highly optimized query processors available in the DMBS, making them an attractive alternative for managing XML data. However, several technical challenges need to be addressed in terms of architecture, algorithms, and implementation of these systems. In this paper, we described how these issues are addressed in the context of colonial systems that use relational databases as the underlying DBMS. The mismatch between the XML and the relational models implies that one must first shred an XML tree-structured document so that it fits into flat relational tables. Therefore, a mechanism is needed to determine the appropriate storage configuration. Once a mapping is selected, the system must provide support for loading the XML data into the database, and to translate queries over the original document into queries over the mapped data. There are different approaches for these problems. For example, while commercial relational systems require users to manually define mappings [14, 15], techniques have been proposed to automatically derive XML-to-relational mappings that adopt either a fixed shredding strategy [19, 11] or that derive the best shredding for a given application [5, 4]. Different techniques have also been proposed for query translation [10, 6]. Although individual problems pertaining to colonial XML storage systems have been studied in isolation, to the best of our knowledge, the design and implementation of a complete colonial system has not been described in the literature. In this paper, we discuss the design and implementation of LegoDB [5], a colonial XML data management system. In particular, we present the complete

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Apply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML

As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...

متن کامل

Apply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML

As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...

متن کامل

A Multiple Adaptive Neuro-Fuzzy Inference System for Predicting ERP Implementation Success

The implementation of modern ERP solutions has introduced tremendous opportunities as well as challenges into the realm of intensely competent businesses. The ERP implementation phase is a very costly and time-consuming process. The failure of the implementation may result in the entire business to fail or to become incompetent. This fact along with the complexity of data streams has led ...

متن کامل

ShreX: Managing XML Documents in Relational Databases

We describe ShreX, a freely-available system for shredding, loading and querying XML documents in relational databases. ShreX supports all mapping strategies proposed in the literature as well as strategies available in commercial RDBMSs. It provides generic (mapping-independent) functions for loading shredded documents into relations and for translating XML queries into SQL. ShreX is portable ...

متن کامل

Querying and Updating XML with XML Schema constraints in an RDBMS

The increasing need of a variety of applications to store and process XML data has led to the development of systems and techniques for XML storage and querying. XML updating hasn’t received a corresponding amount of attention. We discuss XPURS, a system of processing XPath queries and updates on XML Schema-compliant XML data. XPURS updates respect XML ordering and XML Schema typing constraints...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002